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(54) TRANSCODE METHOD BETWEEN VIDEO STREAMS, AND TRANSCODER, AND RECORDING MEDIUM WITH 
PROGRAM FOR THE TRANSCODE METHOD RECORDED THEREON 

(57)Abstract 

PROBLEM TO BE SOLVED: To attain re-coding of a stream of a 
different kind by reducing a transcode processing time. 
SOLUTION: A variable length decoding section 2 decodes an input 
video stream 1 subjected to compression coding through the 
combination of motion compensation and transform coding and 
extracts a syntax parameter from the stream 1 . An input inverse 
quantization section 3 applies inverse quantization to transform 
coefficients of each block of the video stream, a pre-processing 
transform section 4 and a spatial down-sample section 5 obtain a 
transform coefficient to transform the video stream into a frame 
image with different image or special resolution, an output 
quantization section 6 again applies quantization to the transform 
coefficient, and a variable length coding section 7 applies variable 
length coding to an output of the quantization section 6. Data from 
a syntax transform section 10 are given to the variable length 
coding section 7, where the data are composited with the transform 
coefficient outputted from the output quantization section 6 and the «: 
variable length coding section 7 finally generates and outputs and 
output video stream. 
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* NOTICES * 

Japan Patent Office is not responsible for any 
damages caused by the use of this translation. 

1 .This document has been translated by computer. So the translation may not reflect the original 
precisely. 

2.**** shows the word which can not be translated. 
3. In the drawings, any words are not translated. 



CLAIMS 



[Claim(s)] 

[Claim 1] An input image stream by which compression coding was carried out with a hybrid code-ized 
method which combined a motion compensation and conversion coding It is the transformer code 
method changed and outputted at other image streams from which syntax differs. A syntax parameter 
required for transformer codes including input image stream lost-motion vector information is extracted. 
An extracted parameter is changed and saved so that syntax of an output stream may be suited. Without 
performing inverse transformation to a transform coefficient if needed, thin out a bidirectional prediction 
frame from an input image stream, and time amount resolution is reduced. Reduce image size of this 
image stream that time amount resolution reduced, without performing inverse transformation to a 
transform coefficient, and space resolution is reduced. A transformer code method between image 
streams characterized by what an image stream from which the time of an input, space resolution, or 
time amount resolution differs is generated and outputted for under right syntax from this acquired 
transform coefficient information and this saved syntax parameter information. 
[Claim 2] A transformer code method between image streams characterized by amending image 
deterioration of this image stream that time amount and space resolution reduced in a transformer code 
method according to claim 1, without performing inverse transformation to a transform coefficient. 
[Claim 3] A transformer coder which changes and outputs an input image stream by which compression 
coding was carried out with a hybrid code-ized method which is characterized by providing the 
following, and which combined a motion compensation and conversion coding at other image streams 
from which syntax differs A means to extract a syntax parameter required for transformer codes 
including input image stream lost-motion vector information A means to change and save an extracted 
parameter so that syntax of an output stream may be suited A means to thin out a bidirectional 
prediction frame from an input image stream, and to reduce time amount resolution, without performing 
inverse transformation to a transform coefficient if needed A means which generates and outputs an 
image stream from which the time of an input, space resolution, or time amount resolution differs under 
right syntax from a means to reduce image size of this image stream that time amount resolution 
reduced, and. to reduce space resolution, without performing inverse transformation to a transform 
coefficient, and this acquired transform coefficient information and this saved syntax parameter 
information 

[Claim 4] A transformer coder characterized by having a means to amend image deterioration of this 
image stream that time amount and space resolution reduced in a transformer coder according to claim 
1, without performing inverse transformation to a transform coefficient. 

[Claim 5] An input image stream by which compression coding was carried out with a hybrid code-ized 
method which combined a motion compensation and conversion coding It is the record medium which 
recorded a program of a transformer code method changed and outputted on other image streams from 
which syntax differs. A process in which a syntax parameter required for transformer codes including 
input image stream lost-motion vector information is extracted, A process in which an extracted 
parameter is changed and saved so that syntax of an output stream may be suited, A process in which 
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thin out a bidirectional prediction frame from an input image stream, and time amount resolution is 
reduced, without performing inverse transformation to a transform coefficient if needed, A process in 
which reduce image size of this image stream that time amount resolution reduced, and space resolution 
is reduced, without performing inverse transformation to a transform coefficient, A process which 
generates and outputs an image stream from which the time of an input, space resolution, or time 
amount resolution differs under right syntax from this acquired transform coefficient information and 
this saved syntax parameter information, A record medium which recorded a program of a transformer 
code method characterized by recording a program for performing a computer on a record medium 
which this computer can read. 

[Claim 6] A record medium which recorded a program of a transformer code method characterized by 
having a process which amends image deterioration of this image stream that time amount and space 
resolution reduced in a record medium according to claim 5, without performing inverse transformation 
to a transform coefficient. 



[Translation done.] 
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DETAILED DESCRIPTION 



[Detailed Description of the Invention] 
[0001] 

[The technical field to which invention belongs] This invention belongs to image coding technology, 
and relates to the method and transformer coder which carry out receding (transformer code) of the 
image data already encoded with a certain bit rate to the image data of a lower bit rate with which time 
amount differs from space resolution. 
[0002] 

[Description of the Prior Art] There is a precedent of (1) reference "Susie J. Wee, John 
G. Apostolopoulos, NickFeamster, !, Field-to-Frame Transcoding with Spatial and Temporal 
Downsampling", ICIP'99.1999" etc. in the method of carrying out recoding of the image data by which 
compression coding was carried out by the hybrid code-ized method which combined a motion 
compensation and conversion coding to the image data of a low bit rate with which time amount differs 
from space resolution. 

[0003] On the other hand to the method of performing recoding of image data, without performing 
inverse transformation to a transform coefficient (2) — reference "Nishimura **, Hiroyuki Kasai, 
Kazuyuki Takaya, the Kameyama **, and sakaki — Naoaki — Hanamura **, the Tominaga ****, 
"development of real-time MPEG-2 video transformer coder software", ******** ? vol.AVM 25-5, 
pp.25-30.1999", (3) Reference "Pedro A.A.Assuncao, Mohammed Ghanbari, and "A Frequency-Domain 
Video Transcoder for Dynamic Bit-Rate There are precedents, such as Reduction of MPEG-2 Bit 
Streams", IEEETrans.CSVT, vol.8, pp.953-967, and Dec. 1998." 
[0004] 

[Problem(s) to be Solved by the Invention] By the above-mentioned method of (1), while reducing 
image size in a pixel field once elongating the compressed image data by inverse transformation in order 
to perform recoding to the image data with which time amount differs from space resolution, in order to 
take again the redundant configuration of performing coding by the motion compensation and 
conversion coding, there was a problem that the transformer code processing time increased. 
[0005] Moreover, the proposal (above (2) and (3)) of a conversion method without the inverse 
transformation aiming at compaction of the processing time accompanying such a transformer code is 
restricted to the transformer code into which the application field does not change time amount and 
space resolution between the streams of MPEG-2 ->MPEG-2 grade of the same kind, and it was not 
proposed about recoding to the different-species stream accompanied by reduction of time amount or 
space resolution. 

[0006] The purpose of this invention is to offer the record medium which recorded the program of the 
transformer code method on the transformer code method and transformer coder list which shorten the 
transformer code processing time and can perform recoding to a different-species stream. 
[0007] 

[Means for Solving the Problem] A means which is expressed below is used for this invention in order 
to solve the above-mentioned technical problem. 
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[0008] First, a functional-block group which unifies and reduces a transform coefficient changed for 
every block, without performing inverse transformation is prepared. Image data inputted into this 
equipment is inputted into this functional-block group as a transform coefficient block through variable- 
length decode and reverse quantization. Here, space resolution reduction of the whole image is realized 
by generating the new block which unified two or more adjoining blocks, without reducing a block size. 
This functional-block group is ******( e d), it is re-(usually coarse) quantized according to an aim bit 
rate, and image data to which space resolution was reduced creates an image stream finally outputted 
from this equipment by adding and compounding required syntax data. 

[0009] By such processing, inverse transformation of a transform coefficient is not carrying out once, 
and shortens the total processing time. 

[0010] Moreover, since image deterioration accompanying a drift error generally is not avoided about 
re-quantization, it enables it to add the feedback loop which amends a drift error if needed with 
frequency data in this invention. A functional-block group which performs a motion compensation with 
frequency data is contained in this feedback loop, and image quality after a transformer code is raised. 
[001 1] From the above thing, this invention is characterized by record medium which recorded a 
transformer code method on the following transformer code methods and transformer coder lists. 
[0012] (Invention of a transformer code method) An input image stream by which compression coding 
was carried out with a hybrid code-ized method which combined a motion compensation and conversion 
coding It is the transformer code method changed and outputted at other image streams from which 
syntax differs. A syntax parameter required for transformer codes including input image stream lost- 
motion vector information is extracted. An extracted parameter is changed and saved so that syntax of 
an output stream may be suited. Without performing inverse transformation to a transform coefficient if 
needed, thin out a bidirectional prediction frame from an input image stream, and time amount 
resolution is reduced. Reduce image size of this image stream that time amount resolution reduced, 
without performing inverse transformation to a transform coefficient, and space resolution is reduced. It 
is characterized by generating and outputting an image stream from which the time of an input, space 
resolution, or time amount resolution differs under right syntax from this acquired transform coefficient 
information and this saved syntax parameter information. 

[0013] Moreover, it is characterized by amending image deterioration of this image stream that time 
amount and space resolution reduced, without performing inverse transformation to a transform 
coefficient. 

[0014] (Invention of a transformer coder) An input image stream by which compression coding was 
carried out with a hybrid code-ized method which combined a motion compensation and conversion 
coding A means to be the transformer coder changed and outputted at other image streams from which 
syntax differs, and to extract a syntax parameter required for transformer codes including input image 
stream lost-motion vector information, A means to change and save an extracted parameter so that 
syntax of an output stream may be suited, A means to thin out a bidirectional prediction frame from an 
input image stream, and to reduce time amount resolution, without performing inverse transformation to 
a transform coefficient if needed, A means to reduce image size of this image stream that time amount 
resolution reduced, and to reduce space resolution, without performing inverse transformation to a 
transform coefficient, It is characterized by having a means which generates and outputs an image 
stream from which the time of an input, space resolution, or time amount resolution differs under right 
syntax from this acquired transform coefficient information and this saved syntax parameter 
information. 

[0015] Moreover, it is characterized by having a means to amend image deterioration of this image 
stream that time amount and space resolution reduced, without performing inverse transformation to a 
transform coefficient. 

[0016] (Invention of a record medium) An input image stream by which compression coding was carried 
out with a hybrid code-ized method which combined a motion compensation and conversion coding It is 
the record medium which recorded a program of a transformer code method changed and outputted on 
other image streams from which syntax differs. A process in which a syntax parameter required for 
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transformer codes including input image stream lost-motion vector information is extracted, A process 
in which an extracted parameter is changed and saved so that syntax of an output stream may be suited, 
A process in which thin out a bidirectional prediction frame from an input image stream, and time 
amount resolution is reduced, without performing inverse transformation to a transform coefficient if 
needed, A process in which reduce image size of this image stream that time amount resolution reduced, 
and space resolution is reduced, without performing inverse transformation to a transform coefficient, A 
process which generates and outputs an image stream from which the time of an input, space resolution, 
or time amount resolution differs under right syntax from this acquired transform coefficient information 
and this saved syntax parameter information, It is characterized by recording a program for performing a 
computer on a record medium which this computer can read. 

[0017] Moreover, it is characterized by having a process which amends image deterioration of this 
image stream that time amount and space resolution reduced, without performing inverse transformation 
to a transform coefficient. 
[0018] 

[Embodiment of the Invention] Drawing 1 is a block block diagram of a transformer coder which shows 
the operation gestalt of this invention and does not have a drift error correction function. First, the input 
to this equipment is the input image stream 1 by which compression coding was carried out by the 
hybrid code-ized method which combined a motion compensation and conversion coding. There are 
MPEG-2 and MPEG-1 as an example of a concrete stream, and a discrete cosine transform (DCT) is 
used for conversion coding in these cases. 

[0019] All the original methods that combined a motion compensation and other conversion coding 
methods (discrete sign conversion (DST), a Hadamard transform, haar conversion, slant conversion, KL 
conversion) besides these standard methods are contained in the input image stream 1 . 
[0020] Moreover, the output from this equipment is the output image stream 8 by which image size and 
frame rates were reduced rather than the input image stream. As an example of a concrete stream, 
MPEG-4, MPEG-2, MPEG-1, H.261, and H.263 grade are contained. The original format based on the 
same various conversion as an input is also included in the output image stream 8. 
[0021] The flow of transformer code processing of this operation gestalt is explained. The variable- 
length decode of the input image stream 1 is first carried out in the variable-length decryption section 2. 
Furthermore, the syntax data 9 which is needed for generation of an output stream is extracted from an 
input stream. Moreover, the picture type of each frame is distinguished here, when frame rates need to 
be reduced, a bidirectional prediction frame (B-picture) can be dropped, the image stream obtained as a 
result is inputted into the input-side reverse quantization section 3, and reverse quantization of the 
transform coefficient of each block is carried out. 

[0022] Next, the transform coefficient of each block by which reverse quantization was carried out is 
inputted into the functional-block group A which reduces image size with translation data. The 
functional-block group A separates in the pretreatment transducer 4 and the space down sample section 
5. The pretreatment transducer 4 changes a little with classes of input image stream. If the case where 
the field image of MPEG-2 is changed into frame images including H.263, H.261, and MPEG-4 is 
considered as an example, in the pretreatment transducer 4, the coefficient changed in frame DCT mode 
will be first changed into the coefficient of Field DCT. In order to ask for the transform coefficient of 
the block which is equivalent to the bottom field from block Blj which adjoins up and down within each 
macro block, and B-2j (1 j= 2), the following formulas (1) are applied. 
[0023] 
[Equation 1] 
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B bottom =DCT(f ( j. BiJ+DCT ( f 2 ). 8^0=1. 2) 

"oooooooo* 
oooooooo 

00000000 

oooooooo 

0 1 0 0 0 0 0 0 
0 0 0 1 0 0 0 0 
0 0 0 0 0 1 0 0 
0 0 0 0 0 0 0 1 

[0024] Here, the function DCT (f) which acts on blocks fl and £2 expresses conversion by two- 
dimensional DCT to 8x8 blocks, and is specifically defined by the following formulas. 
[0025] 
[Equation 2] 

DCT(f) - (1/2) A-f -A T — (2) 

where A m . n = (1/2) -k ft -cos |m(n+ 1/2) •M/B]{m 9 n=0, 1, 2 7) 

k Jl/^(n=0) 
"11^=1,2,..., 7) 

[0026] Next, these DCT coefficients are inputted into the space down sample section 5, and the down 

sample of a perpendicular / level both directions is performed with a DCT coefficient. For example, it is 

performed as follows for realizing one half of down samples. A vertical down sample is taking out only 

field of one of the two, and is realized easily. On the other hand, a horizontal down sample unifies two 

blocks which adjoined each other horizontally by the following formula with a DCT coefficient. 

[0027] 

[Equation 3] 

|p„7]E4.j]<- 0(j = l, 2) \ 

> •••(3) 

B bou*» =B ^om . DCT(f T) +B 2 toUom . DCT (f J) 

[0028] Here, the 1st formula of a formula (3) expresses the actuation which transposes the component of 
4-7 train (that is, considerable in the right half of a block) of Block Bjbottom to 0. Still more generally 
it is also possible by unifying a formula (3) for the horizontal block of N individual of arbitration by the 
same matrix operation to realize the down sample of 1-/N. Moreover, the same processing is possible 
also in a perpendicular direction. 

[0029] The transform coefficient which returned to drawing 1 and was outputted from functional-block 
A is inputted into the output side quantization section 6, and is quantized again. Next, variable length 
coding of the transform coefficient quantized in the output side quantization section 6 is inputted and 
carried out to the variable-length-coding section 7. 

[0030] On the other hand, the input-side syntax data 9 extracted in the variable-length decryption section 
2 is changed in accordance with an output stream by the syntax transducer 10, and is outputted as output 
side syntax data 1 1. It is compounded with the transform coefficient to which this output side syntax 
data 1 1 was outputted from the output side quantization section 6 in the variable-length-coding section 
7, and, finally the output image stream 8 is generated and outputted. 

[0031] Drawing 2 is a block block diagram of a transformer coder which shows other operation gestalten 
of this invention and has a drift error correction function. First, the input to this equipment is the input 
image stream 1 by which compression coding was carried out like drawing 1 by the hybrid code-ized 
method which combined a motion compensation and conversion coding. The variable-length decode of 
the input image stream 1 is first carried out in the variable-length decryption section 2. Furthermore, the 
input stream lost-motion vector information 13 and the input-side syntax data 9 which is needed for 
generation of an output stream in addition to this are extracted. Moreover, the picture type of each frame 
is distinguished, and when frame rates need to be reduced, a bidirectional prediction frame (Bl picture) 
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can be dropped here. The image stream (it consists of I picture and PI picture) dropped in Bl picture is 
inputted into the input-side reverse quantization section 3, and reverse quantization of the transform 
coefficient of each block is carried out. 

[0032] Next, the transform coefficient of each block by which reverse quantization was carried out is 
inputted into the functional-block group A which reduces image size with translation data. The 
functional-block group A separates in the pretreatment transducer 4 and the space down sample section 
5. Although the pretreatment transducer 4 changes a little with classes of input image stream, it is the 
same as that of the case of drawing 1 about details. 

[0033] Next, these DCT coefficients are inputted into the space down sample section 5, and the down 
sample of a perpendicular / level both directions is performed with a DCT coefficient. It is the same as 
that of the case of drawing 1 too for details. 

[0034] Now, after the amendment signal 19 from the feedback loop is added, the transform coefficient 
18 after scale conversion is inputted into the output side quantization section 6, and is quantized again. 
The output signal 20 of the output side quantization section 6 branches, while variable length coding of 
one side is inputted and carried out to the variable-length-coding section 7, the output side syntax data 
1 1 by which syntax conversion was carried out by the syntax transducer 10 in accordance with the 
output stream is compounded, and the output image stream 8 is generated and outputted here. 
[0035] Another side of an output signal 20 is inputted into the feedback loop B. First, it is inputted into 
the output side reverse quantization section 17, and reverse quantization is carried out. The coefficient 
21 outputted from the output side reverse quantization section 17 takes difference with the transform 
coefficient 18 after scale conversion. 

[0036] Here, the acquired differential signal is an error (drift error) signal produced by re-quantization in 
the present frame. Then, this differential signal is added to the prediction coefficient 22 by which the 
motion compensation was carried out. A prediction coefficient 22 is a coefficient obtained by branching 
in the amendment signal 19 generated and outputted in the motion compensation section 25 in a 
frequency domain, since it is equivalent to the error signal accumulated by the before frame, the total 
drift error accumulated by the present frame is acquired, and this error signal is accumulated in frame 
memories 23 and 24 here. 

[0037] It moves and the motion vector which is needed in the motion compensation section 25 in a 
frequency domain uses the thing which was beforehand extracted from the input stream and which 
carried out scale conversion in accordance with image size by the scale transducer 15 from vector data 
13. Moreover, in order to make a prediction image exact, in the motion compensation section 25 in a 
frequency domain, the image reduced to the frame memory 23 from the bottom field in the image 
reduced from the top field is saved at a frame memory 24, and both sides are used as a reference image. 
[0038] In addition, the program for making a computer etc. perform the procedure thru/or computational 
algorithm of processing is recorded on a memory card, MO, CD, DVD, etc., and this invention can 
distribute it [ example / of an operation gestalt / of this invention shown in drawing 1 or drawing 2 ] 
[ the record medium which this computer can read, for example, a floppy (registered trademark) disk, ] 
[0039] 

[Effect of the Invention] It becomes possible to change into different-species image data, without 
transforming inversely also at once the transform coefficient of the frequency domain by which 
conversion coding was carried out according to this invention the above passage. Thereby, it is effective 
in the ability to shorten the transformer code processing time. 

[0040] Moreover, there is an effect which can improve the image quality after a transformer code, 
without transforming inversely also at once the transform coefficient of the frequency domain by which 
conversion coding was carried out by adding the feedback loop for drift error correction. 



[Translation done.] 
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F/f Trans : 7 \s-J*/7*-»Y DCT«R£&as 

Spatial Down : Sra^»>>^>7 , ;i/ffi 



Qi: 

VLC: 

Syntax IVanfi : 



MPEG-4 fi^HbS 



(7) 



ftffl 2002-152755 



[®2] 



10.. 



LL, A 



13-. 



VLD - Qj' 1 



Fyr 

TVana. 



Syntax TVuu 



Down^ 



15 



f 



Sc*lo 
TV ana 



ID 



25 



I Q, ^- j VLC 

^19 L_ 1 



DCT 
— MC 



FM 



23 



VLD : HT*JH»<tS 
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